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(54) Title: MICROBIAL PROTEIN EXPRESSION SYSTEM 
(57) Abstract 

The present invention provides bacterial strains for secretion of soluble biologically active recombinant heterologous proteins into 
periplasm or on a surface/into a cultivation medium of bacteria. The invention exploits the secretion system of Gram-negative bacteria 
including periplasmic chaperones and ushei/secretin proteins. For accomplishing the aim, the bacterial strains simultaneously express 
the fusion protein (signal peptide)-(matuTe heterologous protein)-<subunit of a bacterial surface structure, CafI), periplasmic chaperone 
specific for the subunit, and outer membrane usher/secretin protein specific for the subunit Secretion of fusion proteins: (signal peptide 
of CaflHmaturc human IL-l^)-(maturc Cafl), (signal peptide of Cafl)-(mature human GM-CSFHmanire Cafl). and (signal peptide of 
CaflHntature human II^lra)-(mature Caft) that were expressed in Escherichia coli simultaneously with the periplasmic chaperone CaflM 
and die usher/secretin protein Cafl A are examples of the use of the invention. 
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MICROBIAL PROTEIN EXPRESSION SYSTEM 



5 FIELD OF INVENTION 

This invention is related to biotechnology and more specifically to production of 
recombinant heterologous proteins by miCTobes. In particular, this invention 
concerns secretin of soluble biologically active heterologous proteins into 
10 perqjlasm and/or on a surfece/into a cultivation medium of Gram-negative 
bacteria. The invention exploits the secretion system of Gram-negative bacteria 
including periplasmic chaperones and usher/secretin proteins of the system. 

BACKGROUND OF INVENTION 

15 

Commercial production of various medically and industrially valuable 
recombinant proteins by microbes is one the key challei^es of modem 
biotechnology. Even though such systems are known, there are severe technical 
problems which are encountered within large-scale exploitation of microbial cell 
20 machinery. There are several secretion systems in Gram-negative bacteria which 
can be potentially exploited for secretion of recombinant heterologous proteins. 
The systems are briefly reviewed below. 

1. Secretion across the inner membrane 

25 

The majority of secreted proteins in Escherichia coli are synthesized as precursors 
with a classic N-terminal signal peptide (SP) which is essential for efficient export 
and which is cleaved during or following translocation across the inner membrane. 
Translocation is mediated by the Sec translocase (SecA/Y/G/E). SecA is a 
30 peripheraUy associated ATPase, which interacts with the signal sequence and 
mature part of the precursor, guides the polypeptide into the translocator and 
provides energy for the process. Sec Y, E and G are integral membrane proteins 
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that form the translocator itself and a central aqueous channel through which the 
polypeptide is translocated. SecD and F have large periplasmic domains. Proposed 
functions of these two proteins are related to later steps in the process and have 
included release from the membrane and mediation of transfer of energy from the 

5 proton motive force. Polypeptides are translocated in an ^unfolded' state. Hence 
SecB is a cytosoUc chaperone apparently dedicated to the secretion pathway which 
is required for export of a subset of secreted proteins. SecB both inhibits premature 
folding and targets the precursor to the membrane translocase conq)lex. It now 
appears that basic principles of the export system are universal Although 

10 translocation is primarily co-translational in eukaryotic systems and targeting to 
the secretion apparatus is primarily via SRP (signal recognition particle), the core 
translocator is homologous in both systems (yeast Sec 61 a and y are homologues 
of E. coli ^YfE). Also, E coliffh and 4.5S RNA are homologues of eukaryotic 
54 KD subunit and 7S RNA of SRP, respectively. Comparison of the eukaryotic 

15 and prokaiyotic systems has been extensively reviewed recently (Rapoport, T., et 
al. (1996)Annual Review of Biochemistry. 65:271-303; Schatz, G., and B. 
Dobberstein (1996)Science. 271:1519-1526). 

Similarities in the basic function of eukaryotic and bacterial export systems have 
20 meant that some mammalian proteins can be successfully secreted to the periplasm 
of E. coli. Examples include human insulin. Often, however, fine tuning is 
required such as optimising the N-terminus of the mature protein, removal of 
positively charged residues at the end of the SP or beginning of the mature protein, 
ensuring presence of a good cleavage site. Frequently, a bacterial signal peptide 
25 such as the OmpA SP has been used- The Cafl signal sequence has also been 
successfully used to export mammalian cytokines (see below). A major problem 
on recombinant expiesswn in E coli is inconrect folding with accompanying 
protein degradation or accumulation in an insoluble and inactive form as inclusion 
bodies. 



30 



In addition to the sec dependent secretion system there are at least two other 
systems of protein transtocation acK)ss the bacterial inner membrane. The Ml 3 
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phage coat protein is also synthesised with an additional SP, but assembly of this 
protein across the membrane is independent of the sec machinery. Recently, a 
novel pathway involved in secretion of cofector-containing protans has been 
elucidated (Santini, G., et al. (1998) EMBO Journal. 17:101-112; Weiner, J., et al. 
; (1998) Cell. 93:93-101). Proteins following this pathway have a long leader 
containing a characteristic 'twin arginine' motif. It is proposed that co&ctor 
attachment occurs in the cytosol and that the folly folded protein is translocated 
across the imier membrane via products of the mtiABC operon. In addition, there 
are cytosoUc proteins ofE. coU which appear to be localised in a priviledged site 
D which is sensitive to osmotic shock. Therefore may have some transient access to 
the periplasm Such proteins include thioredoxin (Lunn, C.A., and V.P. Pigiet. 
(1982) J.Biol.Chem 257:11424-11430) involved in disulphide reduction of 
ceUular components, the cytosoHc chaperone DnaK (Yaagoubi, A.,et al. (1994) 
Journal of Bacteriology. 176:7074-7078), elongation fector Tu (Jacobson, G., et al. 
5 (1976) Biod»emistry. 15:2297-2303) and inner membrane bound components of 
enterobactin synthase complex (Hantash, F., et al. (1997) Microbiology. 143:147- 
156), and capsule assembly (Rigg, G. et al. (1998) Microbiology. 144:2905-2914). 
It has been suggested that this 'compartment' may be related to transient formation 
of adhesion zones between the bacterial imier and outer membranes, but nothing is 
20 known regarding properties of the protein which targets them to this location nor 
about the physical nature of this 'compartment'. A nmnber of 'cytosoUC 
recombinant proteins (i.e. without SP) also behave in a similar mamier and are thus 
presumably targeted to the same cellular location. These include GST fiision 
proteins, interleukin ip (Joseph-Liauzun, E., et al. (1990) Gene. 86:291-295). 

25 

2. Extracelhdar secretion systems 

Six different pathways for export of extracellular proteins have been identified in 
Gram negative bacteria. Each pathway has been identified in a diverse range of 
30 bacteria. The basic properties of these systems are summarised in Fig 1 (recently 
reviewed by Lory (Lory. S. (1998) Current Opinion in Microbiology. 1 :27-35). 
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The Type IL pathway, which is considered to be the main terminal branch of the 
sec- dependent pathway, is used for export of many different unrelated soluble 
proteins. It involves a folded periplasmic intermediate and requires approximately 
12 dedicated genes for export across the OM. Alternative terminal branches to the 

5 sec pathway include specific chaperone-dependent fimbriae assembly and the OM 
helper pathway. The former pathway also involves a periplasmic intermediate (at 
least partially folded) but in this case the secreted polypeptide is specifically 
transported in association with its own chaperone/ outer membrane usher protein 
system. Outer membrane helpers fold into the outer membrane with concomitant 

10 exposure of the effector domain at the cell surface and, in the case of IgA protease, 
release via self-hydrolysis. Interaction with general periplasmic chaperones e.g. 
DsbA has been demonstrated as a critical step m secretion pathway for a number 
of sec dependent proteins. 

15 Type I, Type HI, and most members of Type IV pathways are sec independent and 
mediate secretion of a specific protein (subset of proteins or DNA (Type IV) 
directly from the cytosol. Type I resuhs in secretion into the external media, 
whereas Type HI targets the secreted protein directly into the eukaryotic ceU 
foUowing contact-stimulated activation of the secretion system. The Type HI 

20 pathway also shares many features with flagellar assembly systems. 

3. Secretion of recombinant heterologous proteins in Gram-negative bacteria 

Incorrect folding of proteins in the cytosol may lead to degradation or formation of 
25 misfolded protein as inclusion bodies. In many instances, therefore, it is desirable 
to have heterologous expression of recombinant proteins in the bacterial periplasm, 
at the cell surfece, or in the extracellular media, permitting correct folding and 
formation of a fimctional product. Proteins secreted to the periplasm of £. coli are 
in an oxidising environment, compared to the reducing environment of the cytosol. 
30 The periplasm contains oxidoreductases and chaperones (disulphide bond 
isomerase, DsbA and C, peptidyl prolyl cis-transisomerase, RotA, SurA, and 
FkpA) which are essential for the correct folding of proteins (Missiakas, D., and 
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S.Raina. (1997) Journal of Bacteriology 179:2465-2471). In addition, recombinant 
proteins expressed in the periplasm or secreted to the extracellular medium would 
represent a high percentage of the final protein content of these respective 
compartments. Thus, when the final goal is to obtain a purified recombinant 

5 product, secretion of the product to the periplasm or extemaUy should greatly 
facilitate purification protocols. Although there are quite a few systems available 
for perqjlasmic localisation of protems, there is no major system for secretion of 
extracellular products firom £. coli. Over the past decade there has also been a 
great deal of interest in expressing proteins and peptides on the surfece of 

10 microorganisms. Phage display technology (Winter, G., et aL (1994) Annual 
Review of Immunology 12:433-455) utilises the coat protein of filamentous 
bacteriophage for surface display of proteins or peptides. Such technology has 
been applied to the isolation of specific antibody fi-agments and for the rapid 
identification of peptide ligands. Interest in surfece display in E. coli (Georgiou, 

15 G., et al. (1993) Trends m Biotechnology. 11:6-10) and other Gram negative 
bacteria has centered around identification of protective epitopes and their 
applications as live vaccines, production of bacterial adsorbents and whole-cell 
biocatalysts. 

20 Although there has been some success in expressing of proteins, there are a 
number of limitations within the existing systems as outlined below. 

Most secretory/ assembly pathways of E. coli have been investigated for their 
potential exploitation as secretion vehicles for heterologous proteins. These 
25 include systems that direct the protein to the periplasm, cell surface or extraceUular 
medium. 

3.1 SP alone 

30 A number of expression vectors use a bacterial SP (often that of the OM protein 
OmpA) to mediate export across the inner membrane. Destination of the protein 
depends on the nature of the protein itself It is not uncommon for proteins 
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exported in this way in high levels to form insoluble complexes, inclusion bodies, 
in the periplasm as a result of incomplete folding. 

3.2. Ajfmity purification systems 

5 

Fusion expression systems have been developed to fecilitate downstream 
purification of recombinant products. Examples include insertion of a His tag for 
purification on a Nickel column (Clonlech, Qiagen, In vitrogen); fusion to MalE 
(New England Biolabs), maltose binding protein, with subsequent purification on 

10 an amylose column; thioredoxin fiisions with PAO (phenyl arsine oxide) resin and 
chitin binding domain fusions with chitin columns (New England Biolabs). By 
inclusbn or omission of SP in the vector, some of these systems (e.g. MalE, His 
Tag) can be adapted for periplasmic or cytosolic expression, respectively. In 
general, such vectors contain a highly specific protease cleavage site for 

15 downstream purification of the product. Fusions functional in both domains, e.g. 
MalE and secreted domain, can be obtained. This, however, is dependent on the 
nature of the protein. The carrier domain may interfere with folding of the 
recombinant protein resuhing in protein degradation, insolubility of the protein due 
to membrane association or formation of insoluble inclusion bodies at higher 

20 concentrations. 

3.3. Surface display in E. coli 

Insertion of epitopes into major OM proteins (OmpA. LamB, PhoE), flagella, 
25 fimbriae. These systems involve insertion of epitopes into a permissive site, i.e. 
surfece loop within OM proteins or flagellar, fimbriae subunits, without affecting 
assembly of the membrane protein or surfece appendage. In general there are 
severe size restrictions of the insert (10-60 amino acids) to avoid effects on 
folding and assembly of the protein. There are reports of surfece display of whole 
30 proteins by preparing terminal fiisions to part of the outer membrane protein, 
OmpA or of IgA protease. Using a Lpp- OmpA vector, complete enzymes have 
been localised to the surface off. coli offering the potential of surface display, but 
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these constructs lead to disruption of the outer membrane with concomitant 
toxicity to the ceU and leakage of periplasmic contents. In addition, the fusion 
proteins follow the outer membrane protein assembly pathway. This limits the 
maximum number of surface molecules and more importantly it is evident that 
con?)letely folded proteins possessing disulphide bonds cannot be assembled 
across the outer membrane by this route (Klauser, T., et al. (1990) EMBO Journal. 
9:1991-1999; Stathopoulos. C, et al. (19%) Applied Microbiology and 
Biotechnology. 45:112-119). 



10 3.4. Extracellulco' secretion. 

There have been limited reports on extracellular secretion of unrelated proteins by 
some of the above mentioned secretion pathways. The Hly Type I secretion 
pathway has been adapted to delivery of heterologous antigens (Gentschev, I, et al. 
15 (1996) Graie 179:133-140). Althor^h apparently successfiil, this system delivers 
proteins directly from the cytosol and would preclude any protein which require 
exposure to the periplasmic space for correct folding, e.g. disulphide bond 
formation. 



20 



It is summarised below some of the serious drawbacks associated with 
recombinant protein expression: 



(i) Periplasmic expression systems: Many heterologous polypeptides expressed in 
E. coli are either degraded or form aggregates and inclusion bodies as a result of 

25 incorrect folding. This may occur despite targeting of the protein to a preferred 
location, i.e. the cytosol (with a more reducing environment) or the periplasm 
(with a more oxidising environment and specific chaperones involved in folding). 
Employment of a signal sequence to proteins targeted to the periplasm results in 
varying degrees of efficiency of precursor processing, completion of translocation 

30 and correct folding. Some incorrectly folded proteins remain associated with the 
inner membrane and mduce toxicity. In addition, they are extensively degraded 
resulting in a poor yield. Others accumulate in a non-native conformation as 
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20 



25 



insoluble aggregates. Systems employing fusion proteins are available. These may 
to some degree enhance solubility of some recombinant proteins but others remain 
insoluble due to incomplete folding of the heterologous domain. A system leading 
to stimulation of the early folding event following translocation across the inner 
membrane would clearly enable periplasmic expression of many heterologous 
polypeptides which have thus fer eluded successful expression in E. coli. 

(ii) Surfece localisation in Gram negative bacteria: GeneraUy. there is a strict 
limitation m the size of epitopes which can be expressed at the cell surface using 
proven surfece expression vectors. Systems that permit surface expression of 
whole domains or proteins by fusion them to an outer membrane protein lead to 
membrane permeabilisation. periplasmic leakage and toxicity. In addition, there 
are Umitations on the extent to which proteins can be folded if they are to be 
exported by this pathway. Fmally, as these systems all use integral membrane 
15 proteins, they are limited with respect to the maximum expression level and would 
be very laborious to purify. 

SUMMARY OF THE INVENTION 



30 



The present invention provides bacterial strains for secretion of soluble 
biologically active recombinant heterologous proteins into periplasm or on a 
surface/into a cultivation medium of bacteria. The invention exploits the secretion 
system of Gram-negative bacteria including periplasmic chaperones and 
usher/secretin proteins. For accomplishing the aim, the bacterial strains 
simultaneously express the fiision protein (signal peptide)-(mature heterologous 
proteinHsubunit of a bacterial surface structure, Cafl), periplasmic chaperone 
specific for the subunil, and outer membrane usher/secretin protein specific for the 
subunit. Secretion of fiision proteins: (signal peptide of CaflHtnature human IL- 
iPHmature Cafl). (signal peptide of Cafl)-(mature human GM-CSF)-(tnature 
Cafl). and (signal peptide of Cafl)-(mature human IL-lraHmature Cafl) that 
were expressed in Escherichia coli simultaneously with the periplasmic chaperone 
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CaflM and the usher/secretin protein CaflA are examples of the use of the 
inventioa 

BRIEF DESCRIPTION OF THE DRAWINGS 

Fig. 1. Extracellular secretion systems in Gram negative bacteria. Major 
components include: (i) Type I (e.g. HlyA) - inner membrane (IM) ABC 
transporter (Hy B and D) + an outer membrane (OM) protein (Tol C), no 
periplasmic intermediate detected; 00 Type n (e.g. pullulanase) - Sec system for 
translocation across IM; translocation across OM - Upul gene products including 
an OM secretin (PulC , S), eight IM pul products (Pul C, E,F, K-0) and four 
prepilin-like products (Pul G-J), evidence supports periplasmic intermediate; (Hi) 
Type m (e.g. Yops) - 24 Ysc proteins including an OM secretin (YscC) and IM 
ATPase (YscN), specific Syc cytosolic chs^wrones, and Yop B and D required for 
15 delivery to eukaryotic ceU (iv; Type W- newly classified group including systems 
involved in DNA transfer to plant or other bacterial cells (e.g. T-DNA of 
Agrobacterium tumefaciens, 11 virB genes, unlikely to go via periplasmic 
imermediate; and pertussis toxin export (9 ptl genes, periplasmic assembly of 
toxin likely), (v) OM helper (e.g. IgA protease) - Sec system for transtocation 
across IM; no specific accessory proteins; uses a 'helper domain' within the 
protein which presumably follows the OM protein assembly pathway and folds 
into the OM exposing the secreted domain on the surfece; (vi) Specific chaperone 
mediated assembly (e.g.Pap pili) - Sec system for translocation across IM; 
translocation across OM - periplasmic chaperone (PapD) which specifically 
25 recognises pilin subunits, OM usher/ secretin protein (PapC) homologous to PulC 
and YscC of Typell and TypeHI pathways, respectively. 

Fig. 2. Construction of hybrid genes coding for »PCafl-hIL-ip fusion proteins. 



20 
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Fig. 3. Expression of '"Cafl-hlL-lp fiision proteins. A. Coomassie Blue-stained 
SDS-PAGE of soluble (lanes 2-5) and insoluble (lanes 6-9) proteins from E. coli 
cells transformed withpKKmod (lanes 2, 6), pRKmod/'-'Cafl-hlL-ip (lanes 3, 7), 
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pKKmodf " •Cafl{-2)WL-lp (lanes 4, 8), and pKKinod/"'Cafl(+3)hIL-ip (lanes 5, 
9). UL-lp was loaded as a control (lanes 1, 10). B. Immunoblot of the same gel 
analysed with anti-hlL-lp rabbh polyclonal antibodies. Positions of uiqjrocessed 

(I) and processed (II) hIL-1 p are shown by arrows. 

5 

Fig. 4. Secretion of ^-PCafl-HL-ip fusion proteins. Immunoblots of periplasmic 
(A) and soluble cytoplasmic (B) proteins fiom ^PCafl(+3)hIL.lp (lane 1), 
^PCafI(-2)ML-ip (lane 2), and 'PCafl-hlL-lp (lane 3) expression strains. 
Corresponding proteins from cells harbouring pKXmod were used for comparison 
10 (lane 4). hIL-1 p was loaded as a control (lane 5). Proteins were analysed with anti- 
hlL-ip rabbit polycbnal antibodies. Positions of unprocessed G) and processed 

(II) hIL-ip are shown by arrows. 

Fig. 5. Trypsin digestion of permeabilized cells. Insohible proteins (lane 1-2) and 
15 soluble proteins (lane 3-4) were obtained from *'^Cafl(-2)hIL-ip expression ceUs 
before (lane 1,3) and after (lane 2,4) trypsin treatment. 

Fig. 6. Construction of pCIC plasmid coding for the * ''Cafl(-2)-HL-ip-Cafl 
fusion protein. 



20 



Fig. 7. The N-terminal sequence of mature CIC. After partial purification of 
periplasmic fractions on a DEAE-Sepharose CL-6B column (Pharmacia, Sweden) 
proteins were separated by SDS-PAGE followed by blotting onto a PVDF- 
membrane (Amersham, UK). The desired bands were excised and placed onto a 
25 polybrene coated and precycled glass fiber filter. Amino acid sequence analyses 
were performed with an Applied Biosystems model 477A protein sequencer 
equipped with on-line Applied Biosystems model 120A phenylthiohydantoin 
amino acid analyser. 



Fig. 8. CaflM facilitates expression of 'PCan(-2)-hIL-ip-Cafl (CIC). A. 
Coomassie Blue-stained SDS-PAGE of periplasmic fraction from cells 
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transformed with: pCIC (lane 1), pFMA (lane 2), pMA-CIC (lane 3), pA-CIC (lane 
4), pM-CIC (lane 5), pMA-PrCIC (lane 6), pA-PraC(lane 7), pM.PrCIC(lane 8), 
pCIC, pCaflM (lane 9), pCIC, and pCaflMA (lane 10). B. Immunoblot of the 
same gel analysed with anti-hIL-1 p rabbit polyclonal antibodies. 

5 

Fig. 9. Graphical representation of plasmids constructed for operon-like co- 
expression experiments with CaflM. CaflA, and CIC. pFMA and pCIC are given 
for con[q>arison. 

10 Fig. 10. Detection of CIC in periplasm with monoclonal anti-IL antibodies using 
ELISA. Detection of CIC in dilutions of a periplasm aUquot fiom control cells 
carrying pTrc99 with plasmid (triangles), from cells carrying pac (circles) and 
from cells carryii^ both pCIC and pCaflMA (squares). 

15 Fig. 11. Construction of pFGMFl and pFRF275 plasmids coding for the scaf- 
GMCSF-Cafl and scaf- ILlra -CMlfiision protein. 



Fig. 12. Fractionation of proteins expressed in JMlOl cells with plasmids 
pFGMFl (lanes 3,4,7,8), pFRF275 (lanes 1,2,5,6), and pCaflM (lanes 2,4,6,8). 
20 After induction cells were precipitated, resuspended in buffer with lysozyme and 
incubated on ice for 1 hour foUowed by sonication for 1 min. Sohible and 
insoluble proteins were separated by centrifugation. Lanes 1-4 - insoluble fraction, 
lanes 5-8 - soluble fraction. 

25 Fig. 13. Expression of scafl-GMCSF-Cafl fusion gene. A. SDS-PAGE analysis of 
periplasmic proteins obtained from cells harboring pFGMFl (lane 1), pFGMFl 
and pCaflM (lane 2), and pFGM13 (lane 3). B. Protein immonoblot analyzed with 
anti-GMCSF rabbit polyclonal antibodies. Lanes 1,2 - total cell proteins, lanes 3-5 
- periplasmic proteins. Protein fractions were obtained from cells harborbg 

30 pFGMFl (lane 1,4), pFGMFl and pCaflM (lane 2,3), and pFGM13 (lane 5). C. 
Immonoblot of periplasmic proteins analyzed with anti-Cafl rabbit polyclonal 
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antibodies. Cells produced GMCSF-Cafl fosion protein without (lane 1) or with 
(lane 2) the Cafl M chaperon. 

Fig. 14. Expression of scafl-ILlra-Cafl fusion gene. A. SDS-PAGE of 
5 periplasmic proteins obtained from cells harbouring pFRF275 alone (lane 1) or 
with pCaflM (lane 2). B. Immunoblot of the same gel analysed by anti-ILlra goat 
polyclonal antibodies. 

DETAILED DESCRIPTION OF THE INVENTION 



According to the present inveittion, secretion of proteins by gram-negative bacteria 
can be accomplished with adhesins via chaperone/usher pathway. The periplasmic 
chaperones of the system: 

15 • mediate the partioning of nascently translocated subunits out of the inner 
membrane and into the periplasm; 

• help in the folding of nascent subunits into a native confonnation; 

• protect the subunits from a proteolytic degradation. 

20 The periplasm of wild type strains of bacteria contains oxidoreductases and 
additional chaperones (disulphide bond isomerase, DsbA and C, peptidyl prolyl 
cis-transisomerase, RotA, SurA and FkpA) which are essential to the correct 
folding of proteins. 

25 The ca/operon is the simplest operon in comparison to the size of the operons 
dedicated to most of the other extraceUular secretion systems (Karlyshev, A.V. et 
al. (1994) in Biological Membranes: Struaure, Biogenesis and Dynamic. NATO- 
ASI Series, vol. H-82, Op den Kamp, J.A.F., ed., pp. 321-330, Springer-Verlag, 
Berlm). It comprises of caflR encoding a transcription regulator, the structural 

30 gene for the Cafl polypeptide (Galyov, E.E. et al. (1990) FEBS Lett. 277, 230- 
232) and two genes encoding products for specific secretion of Cafl - the CaflM 
periplasmic chaperone (Galyov, E.E. et aL (1991) FEBS Lett. 286,79-82) and 
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can A outer membrane protein (Karlyshev, A.V. et aL (1992) FEES Utt. 297, 77- 
80). 

Cafl polypeptide is sunthesised with a 20 aa cleavable SP. FoUowing translocation 
across the inner membrane (presumably via the sec pathway), CaflM bmds to 
mature Cafl and protects it from proteolytic degradation by assisting its folding 
and also its release from the im^er membrane (Zav'yalov, V. et al. (1997) 
Biocherr. J.. 324, 571-578; Chapman D. et al. (1999) J. Bacteriology, in press). 
Interaction of subunit with CaflM chaperone then stimulates a signal for 
interaction with the OM protein CaflA and CaflA translocates Cafl to the ceU 
surfece Surfece assembled Cafl forms an amorphous capsule-like stmcture on the 
surfece of the bacteria. Tim surfece structure can be readily recovered with the 
bacterial ceUs and washed off the surfece to give a relatively pure preparation of 
the Cafl protein. CaflM chaperone and Cafl A usher are specific for Cafl subumt 
and not suitable for a secretion of heterologous proteins from this point of view^ 
However, surprisingly it was found in the present invention that the fosions of 
three different heterologous proteins with the Cafl subunit are secreted in soluble 
biologically active form and protected from proteolytic degradation when 
expressed simultaneously with the CaflM chaperone. 

According to the present invention Caf system can be applied for: 

. production of recombinant heterologous proteins in the periplasm of bacteria in 
a soluble, biologically active conformation; 
5 . surfece display of whole recombinant heterologous proteins on Gram-negative 
bacteria - for obtaining heterologous live attenuated vaccines, construction 
whole cell ligands, whole ceU biocatalysts, purificaton of recombinant 
heterologous protein from cell surfece by washing fusion protein out of the cell 
surfece, or by proteolytic cleavage of fusion protein; 

surfece display of heterologous amino acid sequences (epitopes) wfthin Cafl 
subunit, and use as ligand, vaccme candidate etc. 
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Potential implications of the caf system: 

1. Caf ISP-export across the IM: As with many other secretory proteins the Cafl 
5 signal sequence can be used directly for secretion of proteins across the 
bacterial inner membrane. It contains no positively charged residues at its C- 
terminus that might interfere with export and can therefore be used directly. As 
the secreted protein would foUow the Sec pathway to the periplasm, the 
efficiency of export of the heterologous protein would also depend on the 
10 nature of the protein to be secreted. In particular, efficiency of secretion would 
depend on the N-terminus of the mature polypeptide, absence of long potential 
anchor sequences within the secreted protein, and absence of protein folding 
prior to export. The first two features can be addressed by altering the relevant 
sequences. 'Premature' folding is more difficuh to control although excess of 
15 cytosoUc SecB, GroEL, DnaK chaperone may delay folding in some instances. 
Folbwing transtocation across the IM and SP cleavage, release of the 
heterologous protein from the IM and folding of the protein would again be 
dependent on the nature of the heterotogous protein itself. 

2. Caf] M chaperoning in the periplasm: The cq^ system has the added advantage 
20 that it includes a periplasmic chaperone which specifically binds to the Cafl 

subunit. CaflM prevents degradation of the Cafl subunit, probably via 
enhanced folding and release from the IM. Chaperone recognition is via the C- 
terminus of the Cafl subunit although interaction with other parts of the Cafl 
subunit may also be required for high afifmity binding. Hence, fusion of the 
25 Cafl subunit C-terminally to the protein destined for release may stimulate 
folding of the heterologous protein, aid release of the protein from the IM, 
increase its solubility and prevent aggregation (formation of periplasmic 
inclusion bodies) and proteolytic degradation. 

3. CqflA - export of heterologous proteins extracellularly across the outer 
30 membrane. CaflM chaperone also targets Cafl to the OM secretin CaflA. 

Following interaction of the complex with CaflA secretin, the Cafl is 
translocated across the outer membrane and forms a large polymeric structure 
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on the cell surfece -anchored via CaflA. For the case that the anchoring is 
dis&vored a proper mutant of Cafl can be prepared. Thus, inclusion of CaflA 
in the ejqjiession vector would permit targeting of the recombinant protein to 
the Cafl A secretin. 

5 4. Hybrid proteins exported by this system could be proteolytically cleaved to 
remove Cafl by introduction of an appropriate cleavage site between Cafl and 
the protein of interest For proteins arrayed on the ceU surface of E. coli this 
provides an enormously efBcient method of purification from isolated cells. 
Cell surface localised hybrid could also be used for vaccine preparation, ligand 

10 binding or as whole cell biocatalyst. Proteins secreted from the cell or 
recovered from the periplasm would be already in a highly purified form. All 
proteins secreted by this system would be exposed to the periplasm, offering the 
nrilieu for correct folding of secreted proteins. Using this system, the 
probability of folding into a fanctional protein/ domain would be fiirther 

15 increased for some fiision proteins due to interaction with the CaflM 
chaperone. 

In addition to secretion of whole proteins or domains as an N-terminal fiision to 
the Cafl subunit (or possibly smaller domain thereof), it will undoubtedly also 
20 prove to be the case that short epitopes can be inserted into permissive sites within 
the Cafl subunit permitting surface display of epitopes. Similarly, short peptides 
can be inserted into the FGL site of the CaflM chaperone. 

Summary of potential advantages of the cqf system as applied to heterologous 
25 secretion n Gram negative bacteria: 

• despite the feet that protein hIL-ip is insoluble in the periplasm and a fosion 
hIL-lp-Cafl is also insoluble, co-expression with the CaflM chaperone 
surprisingly led to a sohible product. Hence, the CaflM chaperone catalyses 
30 folding, not only of the carrier Cafl domain, but also of the IL-lp domain, 
resulting in release from the membrane and formation of a processed, sohible 
product. Because of correct folding the heterologous fiision protein is not 
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degraded and accumulates in high levels. I.e. CaflM can stimulate folding, 
membrane release and solubility of heterologous polypeptides expressed as 
CqP fusion proteins; 

• while available systems of surfece display of whole proteins in E. coli result in 
5 problems of toss of integrity of the outer membrane and have limitations as to 

the extent of folding of the exported protein, the caf system has not these 
limitations. The fusion protein is anchored to the ceU surface or released as a 
sohible protein without disrupting the membrane. Similarly, assembly does not 
follow the folding pathway of outer membrane proteins and a folly folded 
10 fusion protein is exported; 

• option to readily express high levels of recombinant protein in periplasm 
(CaflM atone) or cell surface/ media (CaflM + Cafl) using the same system; 

• ready purification from the cell surfece by proteolytic cleavage or recovery in 
high levels from periplasmic fractions (whichever preferred); 

15 • exquisitely simple compared to other potential extracellular secretion vectors- 
employing only a periplasmic chaperone (CaflM) and outer membrane protein 
(CaflA); 

• ability to secrete extracellularly a whole protein or part thereof 

. transient or permanent periplasmic localisation of fiiston protein permits correct 
20 folding and oxidation of secretory proteins. 



The invention is fiirther iUustrated here below with specific non-limiting examples. 
The examples describe the secretton of the fusion proteins (signal peptide of 
25 Cafl)-(mature human IL-lp)-(mature Cafl), (signal peptide of Cafl)-(mature GM- 
CSF)-(mature human Cafl) and (signal peptide of Cafl)-(mature human IL-lra)- 
(mature Cafl) expressed in Escherichia coli simultaneously with the periplasmic 
chaperone CaflM and the usher/secretm protein CaflA as the examples of the use 
of the system. Even though Examples describe secretion of 3 proteins it is obvious 
that any other protein can be secreted using this system. In additton, although 
E.coli is used as the host microbe containing the expression system in these 
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examples, it is obvious that also other cells having proper periplasmic space for 
accommodating the caf expression system can be used. 

EXAMPLE 1 

5 Fusion of hlL-lp with the signal peptide of the Cafl protein. 

The Cafl signal peptide C^Can) was fiised with WL-ip in three variants (Fig.2). 
1. ' " Cafl-hlL-ip was the straight fosion where the last amino acid of the Cafl 
signal peptide was joint to the first amino acid of hIL-lp. 2. 'P-Cafl(-2).hIL-lp 
10 was the straight fiision with a mutation in the Cafl signal peptide Asn(-2)Asp. 3. 
*^''Cafl(+3)hIL-lp was the fusion containing in the joint region three N-terminal 
amino acids from mature Cafl to preserve the natural processing site. 



Here and thereafter DNA manipulations and transformation of E. coli were 
15 accomplished according to Maniatis, T., et aL (1989) Molecular Cloning: A 
Laboratory Manual, 2nd edn., Cold Spring Harbor Laboratory Press, Cold Spring 
Harbor. Restriction enzymes, mung-bean nuclease, and T4 DNA ligase were 
purchased from Promega (USA). Taq DNA polymerase (Hytest, Finland) was 
used for polymerase chain reaction (PCR) experiments. Nucleotide sequencing 
20 was carried out using the TaqTrack sequencing kit (Promega, USA). Elution of 
DNA fragments from agarose gels was performed with the USBClean kit (USB, 
USA). Following oligonucleotides were used in EXAMPLES I and 2: 

CAF-RI 5'.GGGAATTCAGAGGTAATATATGAAAAAAATC-3' 

IL-PST 5'.CCGCCTGCAGATGCGGCACCTGTACGATCACTG-3' 

CAF-PST 5'- CCGCCTGCAGTTGCAATAGTTCCAAATA-3' 

IL-Primer 5'-AGAACACCACTTGTTGCTCC-3' 

Blunt 5'-TGGAACTATTGCAACTGCAAATGCGGCACCTGTACGA-3' 

3AA S'-GCAACTGCAAATGCGGCAGATTTAGCACCTGTACGATC- 

ACTG-3" 

IL-BamHI 5'-ACCGGATCCACCTCCACCAGATCCACCTCCGGAAGACACA 
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AATTGCATGG-3' 

BamHI- S'-GGTGGATCCGGTGGTGGTGGATCTGCAGATTTAACTGCA 

Caf AGCAC-3' 

Caf-Sall 5'.GCCAAGCTTGTCGACGAGGGTTAGGCTCAAAGT-3' 

SBEKP-1 5'.TCGACAGATCTCGAATTCCGGTACCGGCTGCA-3' 

SBEKP-2 3'-GTCTAGAGCTTAAGGCCATGGCCG.5' 

STOP 5'-GATCATTAATTAAT-3' 

TRC 5'-CCAGATCTGGCAAATATTCTGAAATG-3' 



The genetic constructs coding three fosion proteins '"Cafl-hlL-ip were made 
according to the scheme in Fig. 2. The Eco^-Pstl fragment (about 110 bp) coding 
5 the Can 5'-untranslated region and N-teiminal part of Cafl signal peptide with the 
mutation Asn(-2)Asp was obtains by PGR usmg CAF-RI and IL-Pst 
oligonucleotides and the template plasmid pKM4 (Karlyshev, A.V.. et aL (1992) 
FEES Letters 297, 77-80) followed by digestion of PCRl product with £coRI and 
Pstl. The Pstl-Hindai fragment coding for the C-terminal part of Cafl signal 
,0 peptide joint to the N-termmus of hIL-ip was obtamed by PGR where CAF-PST 
and IL-Primer oUgonucleotides and the template plasmid pPR-TGATG-hlL-ip-tsr 
(Mashko, S.V., et al. (1991) Gene 97, 259-266) were used. The PCR2 product was 
digested with Pstl and Hindm. The two fragments were ligated together with a 
vector fragment pUC19/£coRI-///«dni. The nucleotide structure of the resulted 
15 Ecom-Hinm insert was verified by DNA sequencing. The £coRI-//md]n 
fragment obtained as described and the ///«din-5«i»HI fragment isolated from 
pPR-TGATG-hlL-lp-tsr were ligated together with the EcoRl-Bamm vector 
fragment of pUC19Atfmdni (a derivative of pUC19 where the Hindffl site was 
deleted by filling of sticky ends followed by blunt-end ligation). The £coRI- 
20 Bamm fragment thus obtained encoded the ^''Cafl(.2)hIL-lp fusion protein. The 
point mutation G to A converting the ^ " Cafl (-2)hIL- 1(3 gene into the Cafl-hlL-ip 
gene was made by means of two-step PCR procedure (Landt, O., Grunert. H.-P. 
and Hahn, U. (1990) Gene 96, 125-128) using the BLUNT oligonucleotide and 
two flanking primers (M13 Sequence Primer and IL-Primer) with 
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pUC19A///wUII/"'Cafl(-2)WL-ip plasmid as a template. In the first step, an 
intermediate PGR product was obtained from the mutagenic BLUNT 
oUgonucleotide and the M13 Sequence Primer. After purification from an agarose 
gel, the intermediate PGR product was used with the IL-Primer in the second PGR 
step. The latter PGR product was digested with £coRI and Hindm fbUowed by 
Ugation into the vector fragment made by restriction of the 
pUG19A/?i«dIII/*-'^Gafl(-2)hIL-ip pksmid with the same enzymes. Thus, the 
Ecom-Hinm fragment providing the Asn(-2)Asp mutation in the '■'*Gafl(-2)hIL- 
IP fiision protein was replaced by the corresponding fragment coding the natural 
Gafl signal peptide, and the •'"Gafl-hlL-ip gene was obtained. The 
"'Cafl(+3)hIL-ip gene was constructed in a similar way using the 3AA 
oligonucleotide as a mutagenic primer and pUC19AHi«din/"'Gafl-hIL-ip 
plasmid as a template. In all three hybrid genes the EcoRI-MndDI fragments were 
sequenced to prove corrert nucleotide structures. 

EcdSJ'Banim fragments coding for the fasion proteins were excised from 
plasmids described above and were transferred into pKKmod (a derivative of pKK 
223-3 (Pharmacia) created by deletion of BamHI- and Sa/I-sites in the tet gene 
region and by replacement of the pKK 223-3 polylinker with the pUC18 
polylinker). The expression plasmids pKKmod/^''Gafl(-2)hIL-ip, 
pKKmod/^''Gafl-hIL-ip and pKKmod/^'' Cafl(+3)hIL-lp were thus obtained, and 
E. coli JM105 [F' traD36, /aci* A(lacZ)Ml 5, pro/l^fi' /thi, rp5L{SXx\ endA, sbcB, 
sbcC, hsdR4{x^- mC\ A(lac-proAB)] (NE BioLabs) were transformed with these 
plasmids. 



25 



Expression and processing of ">Cafl-hIL-ip, ^PCafl(-2)hIL-ip. and 
*-PGafl(+3)hIL-lp fusion proteins were monitored by SDS-PAGE electrophoresis 
and immunoblotting. The recombinant E. coli strains were grown to about 0.5 AU 
at 600 nm, PTG (0.5 mM) was added and cells were grown further. Soluble and 
30 insoluble proteins were analysed as separate fractions obtained as follows. After 
certain time intervals, samples of the growth culture with fixed amount of cells 
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were withdrawn, cells were precipitated and washed in 25 mM Tris-Cl, pH 7.5/100 
mM NaCl/1 mM EDTA. Reprecipitated ceUs were suspended in 50 niM H3PO4- 
Tris, pH 6.8 and sonicated using Labsonic U Generator (B. Braun Diessel 
Biotech). The sonicated samples were centrifiiged at 14,000 g for 10 min. 
5 Supernatant contained soluble proteins. Insoluble proteins were extracted from the 
pellet with the SDS-PAGE sample buffer containing 2% SDS and 5% P- 
meicaptoethanol. 

Here and thereafter 10-15% SDS-PAGE of the proteins from both soluble and 
0 insoluble fractions was performed according to the Laemmli procedure in Mini- 
PROTEAN n apparatus (BIO-RAD). Proteins were transferred to a Hybond-C 
membrane (Amersham) by electroblotting in mini-Trans-Blot Electrophoretic 
Transfer CeU (BIO-RAD) foUowed by immunodetection. Results of immunobtots 
were revealed with the ECL kit (Amersham). Polyclonal rabbit antibodies to hlL- 
15 IP (Calbiochem) and monospecific antibodies to Csfl were used for visualisation 
of IL-lp- and Cafl -containing fusion proteins. Rabbit polyclonal antibodies to 
CaflM and mouse policlonal antibodies to CaflA were used for detection CaflM 
and CaflA, correspondingly. The binding of the prin»ary antibodies were 
visualised by rabbit (Calbiochem) and mouse (Amersham) conjugate. 

20 

High expression was observed for all three fusion proteins (about 20-30% of total 
cell protein). However, in all three recombinant strains fractions of soluble 
proteins did not contain significant amounts of recombinant proteins (Fig. 3A and 
B, lanes 1-4). The recombinant hIL-ip protein was discovered in fractions of 
25 insoluble cell proteins (Fig. 3A and B, lanes 6-9). An additional hIL-1 P contaming 
protein was clearly seen in all three constructs. Most probably, it was a product of 
rather specific proteolysis of unprocessed fusion protein accumulated in 
cytoplasm. 

30 The straight fiision appeared not to be processed (Fig. 3A and B, lanes 7). while in 
the other two products the removal of Cafl signal peptides from ML-ip moieties 
achieved considerable levels (Fig. 3A and B, lanes 8,9). 
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To elucidate secretion of processed recombinant hIL-ip proteins, periplasmic 
proteins were separated from cytoplasmic proteins by osmotic shock procedure as 
foUows. Cells precipitated from 10 ml of grovrth medium were suspended in 200 

5 jil 20% (w/v) sucrose/ 0.3 M Tris-HCl, pH 8.0/ 0.5 mM EDTA and kept at room 
temperature for 10 min. Sucrose-treated cells obtained by centrifiigation were 
suspended in ice-cold 10 mM MgCV 0.1 mM PMSF and incubated in ice water 
for 10 min. After centrifi^ation, periplasmic proteins were recovered in 
supernatant fractions. The pellets were suspended in 50 mM H3P04-Tris, pH 6.8 

10 and sonicated followed by centrifiigation at 14,000 g for 10 min. These 
supematants contained soluble cytoplasmic proteins. The activity of the 
cytoplasmic enzyme, glucose 6-phosphate dehydrogenase, was checked as a 
control of the purity of the periplasmic fraction (Naglak T. J. et al. 1990, 12, 603- 
61 1). The obtained samples of the periplasm fraction did not show more than 0.25 

15 % of the cytoplasmic glucose 6-phDsphate dehydrogenase activity. 

Western blot analysis of periplasmic and soluble cytoplasmic proteins (Fig. 4) has 
shown that only minor parts of the processed fusion proteins were found in a 
soluble form in periplasms of ^PCafl(-2)hIL-ip and *PCafl(+3)hIL-ip expression 
20 strains (Fig. 4A, lanes 1,2). No processed protein was detected in periplasm of 
"^P Cafl-hlL-l p expression strain (Fig. 4A, lane 3). Processed hIL-ip proteins were 
absent in cytoplasmic fractions (Fig. 4B). 

The data obtained demonstrated that the ^''Cafl(-2)hIL-ip and 'P Cafl(+3)hIL-ip 
25 fusion proteins were partly processed (in contrast to the Cafl-hlL-ip fusion 
protein). Processed products were secreted into periplasm. However, the majority 
of the processed recombinant WL-l p accumulated in an insoluble form. Moreover, 
this insoluble protein is hidden from Uquid phase of the periplasm since it is not 
digested during trypsin treatment of permeabilized cells (Fig. 5). Variation of 
30 growth temperature and concentration of the inductor did not feciUtate the Cafl 
signal peptide removal and secretion of WL-l p (data not shown). 
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EXAMPLE 2 

Fusion of '•»^Cafl(-2)hIL-lp with the Cafl protein sequence and expression of 
fused protein in the presence of cbaperone and usher proteins. 

5 

The ac (*P-Caf-IL-ip-£af) protein was designed where the *''Cafl(-2)hIL-lp 
amino acid sequence (N-terminus of the fusion) was linked to the Cafl protein 
sequence (C-temmius of the fusion) through a spacer GlyGlyGlyGlySer repeated 
three times. The spacer was inserted to mmimise possible conformational 
10 problems of the two proteins fused in CIC. 

The pCIC expression vector was constructed according to the scheme on Fig. 6. 
The IL part of CIC was obtained by PGR using IL-Pst and IL-BamHI 
oligonucleotides as primere and the pUC19AW«din/ »*Cafl(-2)hIL.lp plasraid as 

15 a template. The Cafl part of CIC was obtained by PGR using BamHI-Cafl and 
Cafl-Sall oUgonucleotides as promers and the pKM4 plasmid as a template. PCR 
products were digested with restrictases as shown in Fig. 3 followed by triple 
ligation with the pUC19Ai/i«dni/"''Cafl(-2)hIL-l|i vector obtained by digestion 
with Pstl and Sail. To produce pCIC, EcomSaU fragment was excised from 

20 pUC19Ai/mdm/CIC plasmid and ligated into the EcoJU-Sali vector obtained from 
pTrc99ANco plasmid (a derivative of pTrc99a (Pharmacia) created by digestion 
with Ncol foUowed by mung-bean nuclease treatment and ligation of blunt ends). 

After IPTG induction, E. coli JM105 cells harbouring pCIC produced the CIC 
25 fiision protein that was successfiiUy processed. A precise removal of the signal 
peptide was proved by N-terminal sequencing of the soluble periplasmic CIC (Fig. 
7). However, only a minor part of the mature H-ip-Cafl protein was extracted by 
an osmotic shock procedure (Fig. 8B, lane I). Main part of the protein was fomid 
in the membrane fraction (data not shown), similar to mature HL-l p. 

30 
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Expression/secretion of QC in the presence of the chaperone (CaflM) and the 
usher (CaflA) was studied in two systems, a) cqflm and Cqfla genes were 
situated on a pACYC plasmid compatible with the pTrc99 derivative coding for 
CIC. b) caflm, ccfla and cic genes together formed an artificial operoa 

5 

In compatible plasmid e>cperiments, E. coli JM105 cells were transformed 
simultaneously with pCIC and pCaflM (a plasmid carrying cqflm gene under tac 
promoter in a pACYC184 derivative, see EXAMPLE 3) to study influence of the 
chaperone on secretion of hIL-l(J fosion protein. Further, pCaflMA, a CaflM- 
CaflA expression/secretion plasmid, was created as folbws. ApaU-ApaU 
fragment containing cqflm and cafla genes under trc promoter was excised from 
the pFMA plasmid (Chapman D., et al. Structural and fonctional significance of 
the FGL sequence of the periplasmic chaperone, CaflM, of Yersinia pestis" J. 
Bacteriology, in press) and Ugated into the vector obtained by digestion of 
pCaflM with ^paU. E. coli JM105 cells were transformed simultaneously with 
pCIC and pCaflMA to obtain co-expression of all three proteins. 

Several plasmids were constructed to study expression and secretion of CIC, 
CaflM and CaflA proteins produced from an artificial operon (Fig. 9). To replace 
the cafl gene with a SBEKP synthetic polylinker, a pMA-link plasmid was 
obtained by triple Ugation of a pFMA/?5rt-5/>eI vector, a Spel-PsA fragment of 
pFMA, and SBEKP-I and SBEKP-2 oligonucleotides annealed together. A 
plasmid pM-link was obtained from the pMA-link plasmid by excision of a Sofl- 
SaR fragment coding for the Cafl A protein followed by self-ligation of the vector. 
A pA-link plasmid was obtained from the pMA-link plasmid by excision of a 
Bamm-Bcmim fragment coding for the C-terminal part of CaflM protein. To 
interrupt the CaflM translation frame, the stop-codon was inserted by ligation of a 
self-complementary STOP oUgonucleotide into the toffl site. The insertion of 
the STOP oUgonucleotide resulted in the loss of 5amHI site. A fragment coding 
for the CIC protein was excised from pCIC with £ccRI and SaR. cloned in 
pBCSr (Stratagene, USA), and recovered with £coRI and Kpnl. To obtain pMA- 
CIC, pM-CIC. and pA-CIC plasmids. the Ecd^l-Kpril fragment was cloned into 
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corresponding sites of pMA-link, pM-link, and pA-link. A plasmid pM-PrCIC 
differed from pM-CIC by the presence of additional trc promoter before the CIC 
gene, and was obtained as foUows. A DNA fragment coding for the frc-promoter 
and the 5'-region of the etc gene was obtained by PGR using TRC and CAF-Pst 

5 oUgonucleotides as primers and pCIC as a template. The PCR product was 
digested with BgEl and EcoStl followed by ligation of the BglH-Eco^ fragment 
coding for the /rc-promoter into corresponding sites of pM-CIC. pMA-PrCIC and 
pA-PrCIC were obtained by ligation of the Bgia-Kpnl fragment from pM-PrCIC 
into corresponding sites of pMA-link and pA-lmk. E. coli NfM522 [F', proAB, 

10 /ac7'A(lacZ)M15/ supE, thi-1. t^ac-proAB), A(hsdSM-mcrB)5, (tk' mO] 
(Stratagenc, USA) was used as a host strain for plasmids described here, 

Similar resuUs were obtained in both co-expression methods. Simultaneous 
expression of CIC and CaflM significantly increased the concentration of the 
15 mature CIC in the periplasm. The results demonstrated that periplasmic molecular 
chaperone promoted a release of the secreted protein from Ihe inner membrane as 
weU as prevented protein from degradation and unspecific aggregation. 

In the presence of CaflM a significant part of the mature CIC protein was detected 
20 in a periplasmic fraction (Fig. 8B, lanes 3,4,6,7,9,10). Elevated amount of secreted 
and soluble CIC co-expressed with CaflM demonstrates that the periplasmic 
molecular chaperone promotes a release of the secreted protein from an inner 
membrane. The critical role of the Cafl part in the CIC protein in this promotion 
was proved in a similar experiment with a plasmid differed from pCIC by deletion 
25 of one base pair in the spacer. The deletion resulted m the frame shift of the Caf 
part of the fiised gene. No fiicilitation of hIL-1 p secretion was observed when ceUs 
harboured the mutated plasmid (data not shown). 

CaflM prevented CIC from degradation. As shown ih Fig. 8B, lane 1. the mature 
30 CIC protein in the periplasmic fraction rapidly degraded into a truncated form with 
a molecular weight about 20-23 kDa. In the presence of CaflM the truncated form 
disappeared (Fig. 8B, lanes 3,4.6,7,10). The amount of the degraded form 
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25 

correlated with the amount of CaflM. For example, when CaflM was expressed 
from pCaflM at a low level the truncated form was found (Fig. 8B, lane 9). 
However, the CaflM amount was sufficient to fecilitate a release of the fiision 
protein from the membrane. The specific digestion of the mature CIC occurred at a 
5 site in the Cafl part of the fusion protein since the truncated form was well 
detected by the IL antibodies but not detected by the Cafl antibodies (data not 
shown). Most probably the cleavage was due to the action of protease DegP, which 
had been shown is induced by unfolded secreted capsular or pilus subunils and 
cleaves them (Soto, G.E., et al. (1998) EMBO J., 17, 6155-6167). 
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The hIL-1 p part of QC secreted in the presence of CaflM was correctly folded. 
The mature CIC protein was detected with monoclonal antibodies to WL-l p in 
ELISA (Fig. 10). ELISA was performed using monoclonal mouse antibodies to 
hIL-lp (HyTest, Finland) as it was described previously (Zav'yalov, V., et aL 
15 (1997) Biochem. J., 324, 571-578). 

ac was excreted onto a cell surface when expressed with both CaflM and Cafl A. 
The presence of the molecular chaperone and the usher protein in cells 
transformed v«th pMA-CIC or with pCIC and pCaflMA together was proved by 

20 immunoblotting with anti-CaflM and anti-Cafl A antibodies (data not shown). To 
prove the excretion of CIC onto a cell surfece, cell agglutination experiments were 
performed with reticulocyte monoclonal diagnosticum for detection of Yersinia 
pestis (Middle Asian Reaserch Institute, USSR). The E. coli cells expressed CIC, 
CaflM, CaflA were able to precipitate reticulocytes with surface bound 

25 monoclonal antibody to Cafl at the concentration about lO' cells/ml. Moreover, 
according to ELISA performed with anti-hlL-ip monoclonal antibodies, a small 
amount of hIL-1 P was detected in cultural medium. 



EXAMPLES 

30 Co-expression of hGMCSF-Cafl fusion protein and the chaperone 



wo 00/66756 PCT/FIOO/00387 

26 

The expression plasmid for the hGM-CSF-Cafl fusion protein was based on 
pFGM13 described in: Petrovskaya L.E. et al. (1995) Russian Journal of Bio- 
organic Chemistry, 21, 785-791. The pFGM13 plasmid contained a synthetic gene 
coding for hGMCSF with Cafl signal sequence (scafl) cloned into Hindm and 

5 Xbal sites of pUC19. Transcription was controlled by the lac promoter and was 
inducible with IPTG. Translation of the scqfl-gmcsf geae in pFGM13 initiated at 
the first methionine codon odacZ gene and utilises its Shine-Dalgamo sequence. 
A primary structure of the signal sequence (scafl) encoded by this gene differed 
from wild-type one at its N-terminus, which contained seven extra residues from 

10 p-galactosidase N-terminus, 

The construction of the pFGMFlplasmid coding for the gmcsf-cqfl gene is shown 
in Fig. 11. The scqfl-gmcsf gene of pFGM13 was modified for the foUowing 
cloning of the fragment from pCIC containing a spacer (4GlySer)3 and the Cafl 

15 coding region. The Kpnll site was introduced at 3'-terminus of the gmcsf gene by 
PGR using two primers (5'ATCGGAAATGTTCGACCTTCAAG and 
5'ATTATTCCGGACTCCTGCACTGGTTCCCAGC) and pFGM13 as a 
template. Pfu DNA Polymerase, was used for a maximal fidelity. The PGR 
fragment was treated with Kpnll foUowed by ligation into the EcoRV-SalGl large 

20 fragment of pFGM13 together with Kpn2l-SalGl fragment from pCIC. The final 
plasmid (pFGMFl) contained the gene encoding a hybrid precursor consisting of 
the scafl signal sequence, GMCSF with 2 N-terminal amino acid changes 
(AlazProa to Asp), a Ser(4GlySer)3 spacer, and Cafl. The plasmid structure was 
confirmed by restriction analysis and sequencing of amplified regions. 

25 

Expression of the hybrid precursor gene in JMlOl E. eoli cells harbouring 
pFGMFl was induced with 0.2 mM IPTG after ceU culture reached an optical 
density 0,5-0,8. The growth with PTG continued for 3 h. Cells from 1 ml were 
collected by centrifiigation and the pellet (total cell protein sample) was analysed 
30 by SDS-PAGE electrophoresis. A fraction of periplasmic proteins was isolated by 
the cold osmotic shock procedure (see EXAMPLE 1) from the cell pellet obtained 
from the rest of the culture. 
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SDS-PAGE analysis of the total ceU protein sample from E. coli cells harbouring 
pFGMFl revealed the presence of a large amount of the fusion protein with 
molecular mass corresponding to the hGMCSF-Cafl fiision protein (31 kDa). 
5 When the cells were destroyed by sonication this protein was localised in an 
insoluble fraction (Fig. 12, lane 3, 4). 

Examination of periplasmic extracts by SDS-PAGE and Western blotting has 
shown that some hGMCSF-Cafl fiision was translocated to the periplasm. Its 
electrophoretic mobUity was the same as of the insohible proteia The fusion 
interacts with both anti-hGMCSF and anti-Cafl polyclonal antibodies (Fig. 13B, 
lanes 1; 14C, lane 1). The protein with molecular weight about 18 kDa was also 
detected with anti-hGMCSF antibodies (Fig. 13B. lane 4). We can presume that 
this protein is the product of hGMCSF-Cafl fusion proteolytic degradation. 
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15 



To study influence of the CaflM chaperone on ejqjression of the gmcsf-cqfl gaie, 
the caflm gene was inserted into the pACYC-trx, a vector with a low copy number 
and compatible with pBR322-based plasmids. PACYC-trx was pACYC184 
derivative that contained trx gene under the control of tac promoter. The Trx 

20 coding region, flanked with Kpnl and ^/w44I sites, was replaced by the DNA 
fragment coding for the CaflM chaperone as follows. The caflm gene was 
amplified by PCR which included Pfu DNA Polymerase, two primers 
(GTTGTCGGTACCATTCCGTAAGGAGG and 5'- 

GTTAACGTGCACACAGGAACAGC) and the pFS2 plasmid (Galyov EE et aL 

25 (1990) FEES Letters, 277, 230-232). The PCR fragment was treated with Kpnl and 
^/m44I and cloned into the Kpnl-AlyAAl large fragment of pACYC-trx. The 
plasmid obtained was designated as pCaflM. 

JMlOl E. coli cells were transformed with both pFGMFl and pCaflM plasmids 
30 followed by analysis of recombinant ceU proteins as described above. Co- 
expression of the chaperone gene from the pCaflM plasmid led to a marked 
increase in the amount of the full-length hGMCSF-Cafl fiision protein in the 
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periplasm (Fig- 13B, lane 3; 13C, lane 2). The amount of 18 kDa protein, the 
product of hGMCSF-Cafl fiision proteolytic degradation, decreased. These results 
demonstrate that CaflM promotes the correct folding of hGMCSF-Cafl and 
enhances the stability of the fusion protein. 

5 

EXAMPLE 4 

Co-expression of hlLlra-Cafl fusion protein and thechaperone 

Based on pFGM13, the pFRA275 plasnrid was obtained. In this plasmid the 5- 
10 terminal part of the mature WL-lra coding region containd AlaAspAsp-coding 
sequence instead of the first Arg codon in order to neutralise the N-terminal 
positive charge of WLlra. The lac promoter controlled the expression of scaf- 
hillra gene in pFRA275. 

15 The expression plasmid pFRF275 coding for the scaf-hiUra-cqfl gene was 
constructed as shown in Fig. 11. At the 3' terminus of the hillra gene the Kpr,2l 
site was introduced by PGR of the pFRA275 plasmid with primers 5'- 
GGAATCCATGGAGGGAAGAT and 5'- 

ATTATTCCGGACTCGTCCTCCTGAAAGTAG. The amplified fragment was 

20 cut with Ncol and Kpnll and Ugated with the Hincail- Kpnll large fragment from 
pFGMFl together with the Hindm- Ncol fragment from pFRA275. The resulting 
plasmid (PFRF275) contained a gene which encodes hybrid precursor consisting of 
the scaf signal sequence, hll-lra with amino acid changes mentioned above, a 
Ser(4GlySer)3 spacer, and Cafl. Plasmid structure was confirmed by restriction 

25 analysis and sequencing of amplified regions. 

Expression of the scaf-hillra-cafl gene in JMlOl E. coli cells was analysed as 
described in EXAMPLE 3. When cells were transformed with pFRF275 alone, the 
hlLlra fiision protein accumulated mainly in an insoluble form (35 kDa, Fig.l2, 
30 lane 1, 2). However, some part of the fusion protein was translocated to the 
periplasm, where it was subjected to degradation (Fig. 14B, lane 1). When 
PFRF275 and pCaflM (see EXAMPLE 3) were simultaneously present in 
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recombinant ceUs, CaflM noticeably diminished the amount of cleavage products 
(Fig. 14B, lane 2). 
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CLAIMS 



1. A bacterial strain simultaneously expressing the fusion protein (signal peptide)- 
(mature heterologous protein)-(subunit of a bacterial surface structure) and a 

5 periplasmic chaperone specific for the subunit for secretion of a recombinant 
heterologous protein into periplasm of bacteria. 

2. A bacterial strain according to Claim 1 additionally expressing outer membrane 
usher or secretin protein specific for said subunit. for the purpose of secretion of a 
soluble recombfaiant heterologous protein on an outer surfece of the bacterium or 

10 into cultivation medium of the bacterixmi. 

3. A bacterial strain according to Claim 1 or 2, wherein any parts of the fiision 
protein, periplasmic chaperone and/or outer membrane usher and/or secretin 
protein are genetically modified. 

4. A bacterial strain according to Claim 1, 2 or 3, wherein, instead of the fiision 
15 protein, a subunit of a bacterial surfece structure with inserted heterologous amino 

acid sequence(s) is expressed. 

5. Use of the bacterial strains according to Claim 1, 2 or 3 for producing 
heterologous recombinant proteins. 

6. Use of the bacterial strains according to Claim 1, 2 or 3, wherein the protein to 
20 be expressed is GMCSF, IL- 1 p, or IL- 1 receptor antagonist. 

7. Bacterial strain, according to Claim 1, 2 or 3, wherein the microbe is 
Escherichia coli. 

8. A microbial expression of recombinant heterologous proteins according to 
Claim 1, 2 or 3 involving the expression con^onents of Cafl from Yersinia pestis. 

25 9. A gene-ei^ineered Escherichia coli strain containing any of the expression 
systems of Claim 1. 2 or 3. 
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